An Efficient Enhancement of Mining Top-K Association Rule
نویسندگان
چکیده
Data mining on huge databases has been a major issue in research area, due to the problem of analyzing large volumes of data using traditional OLAP tools only. This type of process implies much computational power, disk I/O and memory, which can be used only by parallel computers. So, depending on the selection of the parameters (the minimum support and minimum confidence), current algorithms can be slow and produce an extremely large number of results or produce very less results, omitting useful information. This is really a major problem because in practice users don’t have much resource for analyzing the results and have to discover a certain amount of results in a limited time. To address this problem, we propose a unique technique to mine top ranked data from a data set. The algorithm uses a new method for generating association rules. The algorithm has unique and best performance and feature of scalability, which is a beneficial alternative to classical Association rule mining algorithms when the user wants to control the number of rules generated. Keyword: Association Rule Mining, Data mining, Association rule learning, Top-k rules, Confidence
منابع مشابه
A new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining
Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...
متن کاملAssociation rule mining application to diagnose smart power distribution system outage root cause
Smart grid has been introduced to address power distribution system challenges. In conventional power distribution systems, when a power outage happens, the maintenance team tries to find the outage cause and mitigate it. After this, some information is documented in a dataset called the outage dataset. If the team can estimate the outage cause before searching for it, the restoration time will...
متن کاملA Quick Method for Querying Top-k Rules from Class Association Rule Set
Finding class association rules (CARs) is one of the most important research topics in data mining and knowledge discovery, with numerous applications in many fields. However, existing techniques usually generate an extremely large number of results, which makes analysis difficult. In many applications, experts are interested in only the most relevant results. Therefore, we propose a method for...
متن کاملEfficient Mining Top-k Regular-Frequent Itemset Using Compressed Tidsets
Association rule discovery based on support-confidence framework is an important task in data mining. However, the occurrence frequency (support) of a pattern (itemset) may not be a sufficient criterion for discovering interesting patterns. Temporal regularity, which can be a trace of behavior, with frequency behavior can be revealed as an important key in several applications. A pattern can be...
متن کاملEfficient Mining of Top Correlated Patterns Based on Null-Invariant Measures
Mining strong correlations from transactional databases often leads to more meaningful results than mining association rules. In such mining, null (transaction)-invariance is an important property of the correlation measures. Unfortunately, some useful null-invariant measures such as Kulczynski and Cosine, which can discover correlations even for the very unbalanced cases, lack the (anti)-monot...
متن کامل